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ABSTRACT 

We use a sample of galaxies from the Two Micron All Sky Survey (2MASS) Extended Source Catalog 
to refine a matched filter method of finding galaxy clusters that takes into account each galaxy's position, 
magnitude, and redshift if available. The matched filter postulates a radial density profile, luminosity function, 
and line-of-sight velocity distribution for cluster galaxies. We use this method to search for clusters in the 
galaxy catalog, which is complete to an extinction-corrected Zf-band magnitude of 13.25 and has spectroscopic 
redshifts for roughly 40% of the galaxies, including nearly all brighter than K = 1 1 .25. We then use a stacking 
analysis to determine the average luminosity function, radial distribution, and velocity distribution of cluster 
galaxies in several richness classes, and use the results to update the parameters of the matched filter before 
repeating the cluster search. We also investigate the correlations between a cluster's richness and its velocity 
dispersion and core radius, using these relations to refine priors that are applied during the cluster search 
process. After the second cluster search iteration, we repeat the stacking analysis. We find a cluster galaxy 
luminosity function that fits a Schechter form, with parameters Mk* -5\ogh = -23.64 ±0.04 and a = —1.07 ± 
0.03. We can achieve a slightly better fit to our luminosity function by adding a Gaussian component on the 
bright end to represent the brightest cluster galaxy (BCG) population. The radial number density profile of 
galaxies closely matches a projected Navarro-Frenk-White (NFW) profile at intermediate radii, with deviations 
at small radii due to well-known cluster centering issues and outside the virial radius due to correlated structure. 
The velocity distributions are Gaussian in shape, with velocity dispersions that correlate strongly with richness. 

Subject headings: cosmology:theory - large-scale structure of the Universe 



1. INTRODUCTION 



As the most massive structures known that are in dynami 
cal equilibrium, clusters of ga laxies are useful for studies of 
large-scale structure (e.g., [Bahcall|1988 Einasto et al.|2001~ 



Yang et aT|2005[ |Papovic h 2008 ), as we 1 as for galaxy for- 
mation "and evolutiolT7 v e~gT^|DressleF |Goto et aT 



2003) and for constraini ng cosmological parameters (e.g., 
Henry 2000} |Allen et al |2008| jRozo et al.|20T0| . The prob- 
lem of finding clusters of galaxies has been attacked from sev- 
eral angles. The oldest method is simply to look for overden- 
sities in the two-dimensional distribution of gala xies on th e 
sky (e.g., |Abell| 19581 |Zwicky et al.|1968[|Abell et al.|1989) . 
This method faces difficulties caused by line-of-sight interlop- 
ers, a problem which has been greatly ameliorated in recent 
years by photometric (so-called 2.5-dimensional) and spec- 
troscopic redshift surveys. Several algorithms have been used 
to analyze such data, incl uding the percolation (or friends-of- 
friend s) algorithm (e.g., |Huchra & G eller 1982; Crook et"aX] 
2007)1, the red se quence method (e.g.,|Gladders & Yee |2000[ 
Koester et al. 2007 ) a nd the matched fi lter method (e.g. ,|Post>1 
man et al ||1996| JKepner et al ||1999| IKochanek et al.||2003[ 
Dong et al. 2008 Szabo et al.|20ll) . Other fruitful strategies 



include searching for the thermal X-ray emission of the hot 
intra-cluster gas (e.g.,|Gioia et al.| 1990; Ebeling et al.|1998} 
Bohri nger et al.||2001| |Mullis et al.||2003| l and se arching for 
the weakTe nsing signature of clust ers (e .g., Schneider |1996| 
Wittman et al.|2001[|Sheldon et al.|2009[ ). The most recently- 
developed approach is to search for the thermal Sunyaev- 
Zeldovich decrement in the cosm ic microwave background 
caused by this same gas (e.g.,|Carlstrom et al.|2000[|LaRoque 
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|et al.||2003] |Staniszewski et al.|[2009) i. These methods can 
be used to estimate cluster masses, which are important for 
comparison with theory. But optical and infrared (IR) surveys 
primarily measure cluster richness, a quantity which, though 
correlated with cluster mass, has significant scatter at fixed 



mass. 



In th is paper we follow up the work of Kochanek et al. 
(2003 K03 hereafter), who use a matched filter algorithm 



based largely on the earlier approach of Ke pner et al.| ( |1999] l 
to find clusters of galaxies in the Two Micron All Sky Survey 
(2MASS) Exten ded Source Catalog ( [Jarrett et al.^OOOllSkru^] 
skie et al. 2006 ). This method makes use of the expected prop- 
erties of galaxy clusters — specifically, their shapes in angu- 
lar and redshift space, and the luminosity function of their 
members. Our aim in this paper is to evaluate and update the 
parameters of the matched filter in order to increase the com- 
pleteness and purity of the resulting cluster catalog, and to 
obtain richness estimates that are as accurate as possible. The 
large number of clusters in the 2MASS catalog enable us to 
use a stacking analysis to find their average properties. We use 
an iterative strategy: first, we search for clusters using a filter 
very similar to that of K03, then we stack the resulting clus- 
ters to determine their average radial density profile, velocity 
distribution, and luminosity function as a function of richness. 
Using these properties, we then refine the matched filter and 
repeat the cluster search and stacking analysis, adopting the 
results of this second iteration as our best estimate of the av- 
erage properties of the cluster sample. Throughout this work 
we refer to these as the first and second cluster search itera- 
tions. 

K03 used a few modifications to their likelihood function 
to stabilize parameter estimation and reflect prior knowledge; 
we incorporate and expand these. In particular, one set of pri- 
ors makes use of the relationship between a cluster's richness 
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FIG. 1 . — Apparent magnitude distribution of all input galaxies (gray) and 
input galaxies with spectroscopic redshift measurements (black). 

and its core radius and velocity dispersion. Since our clus- 
ter search method produces estimates of these quantities for 
each cluster, we use the correlations we observe in the first 
cluster search iteration to tune this set of priors for the second 
iteration. 

We use a deeper version of the galaxy catalog used by K03: 
a flux-limited selection from the 2MASS Extended Source 
Catalog with K < 13.25 and Galactic latitudes \b\ > 6°. The 
20 mag arcsec -1 isophotal /T-band magnitudes are co rrected 
for Galactic extinction using the Schleg el et al.| ( [T9 98 ) extinc- 
tion model. Of our sample of 380360 galaxies, 161030 have 
spectroscopic redshifts fr om a prehminary v ersion of 2MASS 

The 2MRS con- 



Redshift Survey (2MRS, |Huchra et al.|20lT 
tains not only redshifts measured by |Huchra et al.| ( |201 l| l, but 
also compiles redshifts from many other sources. Out of about 
950, some of the most important contribut ors are the Six - 
degree Field Galaxy Redshift Survey (e.g. 



, Jones et al. 

the Sloan Digital Sky Survey (e.g., Abazajian et al. 



2004) 
2005) 



the Two-de gree Field Galaxy Redshift Survey (e.g., |Colless 



et al.|200T), the Center for Astrophys ics Redshift Survey (e.g 
Huchr a et al.|1983| | Falco et al.|1999l, the L as Campanas Red- 
shift Survey (LCRST ]5nectman et al 11996) ), the ESO Nearby 
Abell Cluster Su rvey QKatgert et al.|1998[ ), the Souther n Sky 
Redshift Survey (|da Costa et al.|1998[), and others (e.g., Bot- 
tinelli et al.jl990[|Loveday et al.|19 96). The complete list of 
sources is given by Huchra et al.|( |201 1[ ). Essentially all of the 
23046 galaxies with K < 1 1 .25 have redshift measurements, 
as do 138157 galaxies fainter than that limit. The apparent 
magnitude distribution of our galaxy sample is shown in Fig- 
ure [TJ In our second cluster search iteration, we find 7624 
cluster candidates that exceed our likelihood threshold, most 
with redshifts below 0.1. The clusters are essentially charac- 
terized by the intrinsic richness A/*, as well as by an "apparent 
richness" N,,, which declines with distance as galaxies below 
the apparent magnitude limit drop out of the galaxy catalog. 
We show the distribution of the cluster candidates in redshift 
and richness in Figure [2] Together with the likelihood val- 
ues of individual clusters, the completeness and purity of the 
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FIG. 2. — Richness N t and redshift z of the cluster candidates resulting 
from our second cluster search iteration. The black points are those clusters 
for which ^,666 > 3, a sample which K03 found to be highly pure. The black 
histograms correspond to these clusters, while the gray histograms represent 
all clusters with A In C > 5 . The search is deliberately extended to low A In £ 
and N f to allow for later investigation of false positives. 

cluster catalog decline as N g decreases. They, as well as mass 
estimates and comparisons to other cluster surveys, are the 
subject of a forthcoming paper. 

In Section [2] we describe our galaxy cluster model, which 
includes the spatial and velocity distribution of cluster galax- 
ies and their luminosity function. We use this model as a 
matched filter to find clusters. In Section [3] we describe 
the stacking technique with which we determine the average 
properties of the clusters in order to update the model. In Sec- 
tion!? we describe the priors that we use the modify the likeli- 
hoodfunction, and use the initial sample of clusters to update 
some of their parameters. Finally, in Section |5]we conclude. 
The work described in K03 used a cosmological model with 
J7m = 1 and H,\ = 0. The details of the cosmological model are 
not very important for our purposes since the 2MASS galax- 
ies are nearby, but for this work we use a cosmology with 
VLm = 0.3 and fl\ = 0.7. We use the usual parameterization 
for the Hubble constant, with H Q = lOOh km s" 1 Mpc -1 . 

2. THE MATCHED FILTER METHOD 



Following the method of Kepner et al. ( 1999 1 as expanded 
by K03, we search for clusters by computing a likelihood 
which is the convolution of the observed distribution of galax- 
ies (in angular, redshift, and luminosity space) with a fil- 
ter tuned to match the "shape" of galaxy clusters. This 
convolution smooths out the small-scale details of individ- 
ual galaxy locations but maximizes the signal due to actual 
cluster-shaped galaxy overdensities. This method has several 
advantages. It provides estimates for the likelihood of each 
detected cluster, as well as for the membership probabilities 
of each nearby galaxy. In its K03 realization, it also provides 
best-fit values and uncertainties for cluster properties such as 
richness and velocity dispersion, and it is flexible enough to 
handle galaxies with or without redshift and color informa- 
tion; this is important for our sample, where not every galaxy 
has a redshift measurement. 
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2. 1 . The Cluster Model 

We add clusters to the catalog in an iterative fashion, at each 
step evaluating the likelihood function at the position of each 
galaxy in the sample and adding a cluster centered on the 
galaxy with the highest likelihood. The likelihood function 
is constructed from the probabilities of the nearby galaxies 
to be cluster members or non-member field galaxies; these 
probabilities make up the matched filter. For a proposed n c -th 
galaxy cluster, the change in likelihood is 



A In C(n c ) = -N* n A nc + V" In 



(i) 



where the sum in i is over all galaxies within the sampling 
radius /? samp = 1.0 h~ l Mpc. The term -N*„ c A„ c is an estimate 
of the total number of member galaxies we expect to be visi- 
ble within R < R sdmp of the proposed cluster, given the survey 
magnitude limit; we give an expression for it in Equation dT4b. 
The definitions of Pf and P c are given in Equations (|2]) andfRji, 
respectively. The expression for the likelihood in E quation (fill 
is derived in Appendix C2 of Kepner et al. ( 1999 ). The likeli- 
hood depends on the richness, redshift, core radius, and veloc- 
ity dispersion of the candidate cluster, mostly through P c \ we 
optimized these values using the Markov Chain Monte Carlo 
(MCMC) method while evaluating the likelihood. The clus- 
ters labeled k = 1 • • • n c — 1 are already in the cluster catalog, 
and have fixed properties. We account for previously found 
clusters via their inclusion in the denominator of the last term 
in the equation; this effectively removes them from the den- 
sity field in a manner similar to the Clean algorithm of radio 
astronomy (Hogbom 1974). We stop iterating when no cluster 
is found that increases the likelihood by more than a predeter- 
mined cutoff value. We choose Aln£ = 5 as our cutoff; this is 
low enough that most of the low-likelihood cluster candidates 
are in fact false positives (see K03). In Section [3] we select 
a relatively pure subset of clusters for analysis, all of which 
have likelihoods well above the cutoff. The likelihood of a 
given cluster correlates well with AL, the number of member 
galaxies brighter than the survey limit, and only weakly with 
the actual richness N* . 

The probability of finding a field galaxy with a given ab- 
solute K magnitude Mk and redshift z in some infinitesimal 
portion of the sky is 



P f = 0.4 ln( 1 0)Dp(z) 4>f(M]c) 



(2) 



where Dq is the comoving distance and 0/ is the field galaxy 
luminosity function. When the redshift of the galaxy is not 
known, we average Pj over the range < z < 1 ; this is effec- 
tively the differential number count of the 2MASS surv ey. We 
follow K03 i n adopting the luminosity function of Kochanek 
et al.| ( 2001| l for </>/. It is a Schechter function (Schechter 
19/bp , with parameters Mk* =-23.39 and a = -1.09, and nor- 
malization n* = 1.16 x I0~ 2 h 3 Mpc" 3 . We also use this lumi- 
nosity function for cluster galaxies in our first search iteration; 
see Equation (Bl). In the second iteration we update the clus- 
ter luminosity function, but the field luminosity function stays 
unchanged. Following the example of K03, we calculate the 
absolute magnitudes using an effective distance modulus 



V(z) = K-M K ee 51og(D L (z)/10 pc)+k(z) 



(3) 



that includes a ^-correction k(z) = -6 log( 1 + z) in addition to 
the term containing the luminosity distance Di{z). As K03 



note, this ^-correction is negative, independent of galaxy type, 
and valid for z < 0.25. Due to our parameterization of Hub- 
ble's constant, we report values of M/j-51og/i, but hereafter 
we omit the second term for the sake of brevity. 

The model for the distribution of galaxies with absolute 
/T-band magnitudes M K , projected radii R, and measured red- 
shifts z relative to a cluster at redshift z c with richness N*, ve- 
locity dispersion er c , and scale radius r c (or alternatively, the 
probability of the cluster having a member galaxy with these 
characteristics) is 



dN 



d 2 xdMfcdz 
c(M K ) 



--N, 



exp 



cHz-ZcY 
'loMX+Zc) 1 



(4) 



The normalization N* is the number of cluster galaxies 
brighter than Mk* within a spherical radius r out of the clus- 
ter center. This radius ought to be roughly similar to the virial 
radius. Like most previous matched filter studies, we choose 
r out = 1 .0/T 1 Mpc. Using a fixed physical radius is convenient 
for calculations, and avoids adding extra scatter to richness 
estimates via the use of a noisy virial radius esti mate. We 
discuss other possible richness measures in Section 2.2 The 
integrated luminosity function $ C (M^) is the spatial density 
of galaxies brighter than Mk, and is the cumulative integral 
of the cluster luminosity function c/) c {Mk)- The function £(/?) 
represents the two-dimensional spatial distribution of galax- 
ies, and is given by a projected version of the Navarro-Frenk- 
White (NFW, |Navarro et al.|1997) profile. Finally, the Gaus- 
sian factor in redshift z represents the line of sight velocity 
distribution of the cluster galaxies. We describe these compo- 
nents in greater detail in the following paragraphs. 

The luminosity function of cluster galaxies is assumed to 
be given by a Schechter function with fixed parameters a and 
M K *, 



e (MK) = 0.41n(10K 



Lk* 



exp 



Lk* 



(5) 



where M K -M K * = -2 .5log(L K / L K *)- The integrated lumi- 
nosity function, which describes the density of cluster galax- 
ies brighter than a specified magnitude, is thus the incomplete 
Gamma function 



Mr 



c {M)dM = n*T l+a 



(6) 



Since <f) c only appears in our calculations as a fraction of 
$ c , its normalization constant is unimportant. For our 
first cluster finding ite ration, we follow K03 in adopting the 
Kochanek et al. ( 20Q1| > luminosity function and set a and Mk* 
to -1.09 and -23.39 mags, respectively. Part of the purpose 
of this work is to verify the appropriateness of this choice for 
the cluster galaxy luminosity function, and we update these 
parameter values in our second iteration. 

We model the angular distribution of cluster galaxies as the 
two-dimensional projection of an NFW profile. For a clus- 
ter with a scale radius r c , this distribution, normalized by the 
number of galaxies within an outer radius Cr c , is given in three 
dimensions by 



P(r) = 



1 



1 



4irr 3 c F(C) x(l +x) 2 



(7) 
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where x = r/r c and F(x) = ln(l +x)—x/(l+x). The parameter 
C is similar to the concentration of the cluster. For the purpose 
of normalizing this profile (or equivalently, defining N*), we 
fix it to C = (1 .0/T 1 Mpc) /r c . The projected profile is 



£(*) = 



f(R/r c ) 
2ir^F(C) ' 



where 



(x 2 -!) 1 / 2 



tan 



x-l 
x+\ 



1/2' 



(8) 



(9) 



and for x < 1 we use the identity -/tan (ix) = tanh (x). Fi- 
nally, the number of galaxies enclosed within a circular radius 
R is given by N(< R) = g(R/r c )/F(C), where 



* W = ln © + (x 



(x 2 -l)'/2 



tan 



x-l 
x+1 



1/2 



(10) 



(Bartelmann 19961. Apart from its normalization, this density 
profile model has a single parameter: the core radius r c . We 
allow it to vary when calculating likelihoods in order to max- 
imize the likelihood (subject to some priors; see Section [4l. 
The "concentration" parameter C varies with it, being defined 
as (1.0/T 1 Mpc)/r c . This is in contrast to K03, who fix r c and 
C to values of 0.2 h Mpc and 4, respectively. 

The final factor in the cluster model is a Gaussian function 
in the galaxy redshifts. Its parameters are the cluster redshift 
z c and velocity dispersion <j c , both of which we vary in order 
to maximize the likelihood. For galaxies without a redshift 
measurement, the cluster probability P c is integrated over all 
possible galaxy redshifts, turning this factor into unity. 

Part of the goal of this work is to tune the parameters of the 
matched filter that we use to find galaxy clusters. We address 
the following points: 

• The suitability of the projected NFW profile at a range 
of scales. 

• Whether the velocity distribution of cluster galaxies is 
consistent with a Gaussian, and the richness-dependent 
width of the distribution. 

• The luminosity function parameters a and Mk*, as well 
as the necessity of an extra component on the bright end 
to account for the presence of a brightest cluster galaxy 
(BCG) population. 

2.2. Derived Quantities and Richness Transformations 

Our primary richness measurement is AT*, defined as the 
number of galaxies brighter than L + within a fixed radius 
r < '"out = I -Oh' 1 Mpc. Though convenient for our purposes, 
it is not easy to compare to more theoretically motivated in- 
dicators, which are usually defined with respect to a virial ra- 
dius within which the average density is some multiple of the 
background. To address this issue, we define a second indi- 
cator iV*666 as the number of L > galaxies within a virial 
radius r^. We calculate this radius by numerically solving 
the equation 

47T 

N* An =N*F(r A Jr c )/F(C)=—n*A N r 3 AN r(l + a,l) (11) 

for r& N , with the number overdensity Ajv set to 666, corre- 
sponding (for unit bias and VLm = 0.3) to a mass overdensity 




FIG. 3. — Richness estimate ^,666 versus AT* for clusters with ^,666 > 3 
and Aln£ > 5 found in the second cluster search iteration. The solid curve 
is the best-fit power law. On the right axis we show a rough estimate of the 
cluster mass M2 00, as estima ted f rom iV*666 in the X-ray stacking analysis of 
|Dai et al.|(2007) See SectionfO] 

relative to the critical density = A^CIm = 200. The den- 
sity n* is taken from the field galaxy luminosity function cj)f, 
and a is the slope of the cluster luminosity function cj> c . It 
is worth noting that aside from a relatively unimportant de- 
pendence on a, ./V*666 is completely determined by AT* and r c . 
The two richness indicators are highly correlated, and their 
values from the second cluster search iteration are plotted 
against each other in Figure [3] The best-fit relationship be- 
tween them is log JV* 666 = (-0. 34 ± 0.0 1 ) + ( 1 .43 ± 0.03) log N* , 
and we measure a scatter in log(Af*666) of 0.009 at fixed N*. 
|Dai et al7| ( |2007[ ) explore the relationship between A^*666 and 
M200, the mass of clusters measured within the standard radius 
^200 where the overdensity is 200 times the critical density. 
They find that they two quantities are nearly linearly related, 
with 



log AT* 6 66 = 

(1.10 ± 0.04) + (0.87 ± 0.05) log 



Af 200^70 

10 146 M ( 







(12) 



To give a rough idea of the range of masses of our cluster 
sample, we show this relationship in Figure [3] The plotted 
values take into account their use of hjo, as well as the differ- 
ence between our second-iteration A^*666 and that of Dai et al. 
(2007) due to the change in the cluster luminosity function. 
We discuss this difference in a few paragraphs. 

The number of galaxies actually detected in a cluster of a 
given richness controls its observability. This is a distance- 
dependent quantity: for a nearby cluster we detect many faint 
galaxies that appear brighter than our magnitude limit, while 
an equally rich distant cluster will manifest only its brightest 
few galaxies. We define A^ g 666 as the number of cluster galax- 
ies within r < r^ brighter than the survey magnitude limit: 



N g 666 - Af*666 



r(l+a,Li im (z c )/L*) 
r(l+a,l) 



(13) 



where Li; m (z c ) is the galaxy luminosity corresponding to the 
survey limit at redshift z c . A related quantity appears in the 
first term in Equation ([T]). The number of galaxies within the 
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circular sampling radius /? samp brighter than the limiting mag- 
nitude is given by 

A (Z)= z—^r ^7— (14) 



F(C) 



r(l + a,l) 



for a cluster of richness A 7 * = 1, where L]j m is defined as in 



Equation ( 13 1, and g(x) is defined in Equation ( 10 1. 

Finally, in some cases we are interested in the probability 
that a certain galaxy is a cluster member. Despite its simi- 
lar name, this is not the same as the probability of finding a 
galaxy near a cluster P c . For a galaxy labeled i near a cluster 
labeled k, we define its membership probability as 



raemb 



(»,*) = 



Pc(i,k) 



(15) 



where Pj and P c are defined in Equations Q and (Eji, the and 
sum is over all clusters. T his definition is nearly faentical to 
that of Ro zo et aX] ( |2009| l, except that they consider only the 
j = k term of the sum. 

There is a subtle but important difference between the val- 
ues of and AT* 666 that we find in our first cluster iteration 
and those we find in the second iteration. This is because we 
change the parameters of the luminosity function between the 
iterations, most importantly the cutoff magnitude Mk* ■ Since 
A 7 * is defined as the number of galaxies brighter than Mk*, we 
change this count when we change M%* ■ In Section |3.3| we 
discuss the change in the matched filter luminosity function 
between the first and second search iterations. As the clus- 
ter search algorithm varies A 7 * to maximize the likelihood, it 
is effectively fitting the matched filter luminosity function to 
the observed galaxy distribution by varying its normalization. 
Therefore, our best estimate of the transformation between 
A 7 *! from the first cluster search iteration and A 7 * 2 from the 
second is the ratio of normalizations of the Schechter func- 
tions that best fit the stacked clust er luminosity function. We 
perform these fits in Section 3.31 The normalization of the 



second-iteration luminosity function is a factor of 0.838 lower 
than that of the first iteration; therefore, A 7 * 2 ~ 0.838A 7 *! for 
any given cluster. With the (reasonable) assumption that r c 
does not depend on the shape of the luminosity function, the 
virial richness AT* 666 varies the same way. The transformation 
of ATg666 is more complicated; it involves not only the ratio of 
normalizations but the fraction of galaxies brighter than the 
survey limit. The ratio of the second-iteration A^666 to the 
first-iteration one is 



0.838 



r(l+a 2 ,L lim /L. 2 )r(l+ai,l) 

r(i+ c^Ljto/L*!) r(i+o 2 ,i) ' 



where the subscripts 1 and 2 refer to the first and second 
iterations, and Ln m is the limiting luminosity as in Equa- 
tion (TT3J. This value is redshift-dependent, varying from 
0.868 at z c = 0.01 to 1.328 at z c = 0.1. It is unity at a red- 
shift of 0.047, and since this is a fairly typical redshift for our 
clusters we adopt this value, so that there is no change in A^666 
between iterations. We tested these ratios by matching a sub- 
set of our second-iteration cluster catalog to our first-iteration 
catalog, and found them to be correct predictions. Through- 
out this paper, we report the values of and A^666 calculated 
by our algorithm without applying the correction factor, but 
we caution the reader to take care in comparing their values. 
In some cases we use the factor to define subsets of the cluster 
catalogs that should be roughly equivalent between iterations, 



and we note our use of the transformation factors in those in- 
stances. 

3. CHARACTERISTICS OF STACKED CLUSTERS 

We cannot characterize the radial profile, luminosity func- 
tion, or velocity distribution of individual clusters in any de- 
tail, because the typical cluster contains too few galaxies. But 
by "stacking" large number of clusters, we can determine their 
average properties. Methods simi lar to this have been used 
in several studies (e.g.,[C arlberg et al. 1996, |Dai et al.|2007| 
|Rykoff et al.|2008[[Rbzo et al.|2010p . We apply the process to 
a sample of clusters with likelihoods A In C > 5 and expected 
galaxy number A^666 > 3. K03 found that this last require- 
ment resulted in an extremely pure sample of clusters. 

The stacking process consists of selecting the galaxies 
within some physical radius of the cluster center and con- 
structing a histogram of the relevent quantity (i.e., radial posi- 
tion for a radial profile or absolute magnitude for a luminosity 
function), subtracting an appropriate background (estimated 
using the entire galaxy sample), and averaging the results for 
the sample of clusters. We estimate the uncertainties in our 
averages using bootstrap resampling, an approach which we 
advocate for future studies. We resample both the cluster and 
galaxy lists with replacement to construct the bootstrap uncer- 
tainties. In all cases we resample 100 times. 

Before stacking, we separate our cluster catalog into rich- 
ness bins. We divide the space between AT*666 = 0.1 and 
A^666 = 30 into five logarithmic bins 0.5 dex wide. For the first 
cluster catalog (i.e., the catalog resulting from the first clus- 
ter search iteration), this division excludes five clusters with 
AT* 666 < 0.1 and two with AT*666 > 30, leaving a total of mem- 
bership of 58, 365, 596, 425, and 85 clusters in the five bins, in 
increasing order of richness. In the second cluster catalog, we 
multiply the bin edges by a factor of 0.838 to account for the 
fact that the richness measurements are systema tical ly lower 
than they were in the first iteration (see Section |2~2| ). These 
five bins contain 46, 450, 867, 594, and 125 clusters. We 
further subdivide these bins into membership samples with 
3 < N gm < 5, 5 < Ngm < 7, 7 < N g6b6 < 10, and Ngm > 10 
detected galaxies. This is to check whether there is any bias 
in our estimates of the cluster properties with the number of 
detected galaxies. Since the number of detected galaxies de- 
pends primarily upon redshift, changes between these sub- 
samples could also indicate evolution of average cluster prop- 
erties, but at the low redshifts of our clusters we expect little 
evolution. 

It is desirable to use a richness estimator with a s mall scatter 
relative to the actu al cluster richness (e.g., Rozo et aL|2009| 
Rykof f et al.|2012 i. Although A 7 * 666 is useful for comparison 
with theoretical studies, it is possible that the nature of its def- 
inition (i.e., with respect to other noisy quantities) makes it a 
noisier estimator than the more simply defined A 7 * . To check 
this, we divide the clusters resulting from our second clus- 
ter search iteration into bins of N*, matching the bin edges 
to those of our previous bins using the bes t-fit relation be- 
tween A 7 * and AT*666 described in Section [2~2| We examine the 
uncertainties in our average profiles, velocity distributions, 
and luminosity functions resulting from the bootstrap resam- 
pling, comparing them to those from the AT* 666 binned sam- 
ples. There are no significant differences in the size of the 
error bars, so we cannot conclude on those grounds that N* 
is a better richness estimator than AT*666- Given the extremely 
tight correlation between A 7 * and A^666, this is not surprising; 
the richness bins contain nearly identical sets of clusters in 



6 



BLACKBURNE & KOCHANEK 



both cases. 

3.1. Density Profile 

We first compare the average surface density distribution 
to the projected NFW we use for the matched filter. 
We measure the density profiles in a series of logarithmically 



spaced annuli with inner and outer edges R m j < Rj < R, 



centered on Rj = (Ri n .jR out .j) ' , and with area Aj = n(R^ at j- 
Rf n j). The expected number of background galaxies in an an- 
nulus is 

bj = 2TTB[(l + Rl J /D A (z) 2 Y 1/2 -( l+R L.j/DA(z) 2 y m 

~ BA j / Da(z) 2 , (16) 

where B is the angular surface density of galaxies to the sur- 
vey magnitude limit (by "background", we mean a combi- 
nation of foreground and background galaxies). The higher- 
order terms in the curved-sky Taylor expansion become im- 
portant when we examine nearby clusters at large radii. If we 
count Nj galaxies within the j'-th annulus, the surface density 
profile of the cluster, scaled by its richness jV*, is 



£(*,) = 



1 N, 



*666 



g666 



Nj-bj 



(17) 



The quantity {Nj — bj)/Aj is the surface number density of 
cluster galaxies, after subtracting the mean background bj. 
The factor N^^/Ng^ converts the observed number of 
galaxies (brighter than the survey magnitude limit) to the av- 
erage number of L > galaxies; see Equation ( |13) , We aver- 
age this estimate of the surface density over all clusters in each 
richness and membership bin. At small and medium scales, 
all the clusters in the sample contribute, but at the largest 
scales the number of contributing clusters declines slightly 
because a few clusters overlap the Galactic latitude bound- 
ary \b\ > 6.0°. We also calculate the average AT* and r c for the 
clusters in each richness bin. 

Figure|4]shows the projected density profiles of the stacked 
clusters from the first and second cluster search iterations for 
each richness bin, including all clusters with at least N<,(,66 > 3 
member galaxies and Aln£ > 5. In each bin, the profile has 
been multiplied by the average AT* of the bin. We superpose 
the projected NFW profile £(R) that was used to find the clus- 
ters, calculated using the average N* and r c in each bin. In 
the second panel we also show an ad-hoc profile for the two- 
halo contribution suggested by the observed average profiles 
(more on that in a few paragraphs). It is important to note that 
the model profiles are not fits to the galaxy profiles, but are 
simply the matched filter model calculated using the average 
cluster properties. We note three distinct radial regimes in the 
profiles. 

First, at small radii (R < 0.1 h~ l Mpc) the profiles are sen- 
sitive to the method used to center the clusters. The difficulty 
of determining the central galaxy density profile of clusters 
due to the method used to center the cluster is well known 



(e.g., [Beers & Tonry|1986| l. The most obvious sign we see is 
a distinct density spike in the innermost bin; this is due to our 
practice of always centering a cluster on a galaxy. We avoid 
"smearing" the contribution of the central galaxy over the re- 
gion with R < r c , but there will always be problems recon- 
structing a possibly singular average central density profile 
in the presence of the shot noise from the galaxies sampling 
the profile. We experimented with using a cluster position 



10 < M.666 < 30 

• 3 < AU66 < 10 

■ 1 < < 3 

A 0.3 < AU 6 6 < 1 

O 0.1 <AU66 <0.3 




FIG. 4. — The projected distributions of galaxies for clusters from the first 
(top panel) and second (bottom panel) cluster search iterations. They are 
divided into richness classes from low (open circles) to high (filled circles) 
richness; note that the sec ond it eration bin edges are smaller than those of the 
first iteration (see Section |2~2) . This figure uses the full sample of clusters 
with A^666 > 3 (i.e., the union of the membership classes). The points are the 
observed distributions, with their bootstrap uncertainties. The central density 
spike is an artifact resulting from our practice of centering every cluster on 
a galaxy. The first panel shows the projected NFW model. In the second 
panel, the NFW and two-halo profiles are shown in dotted lines, while their 
sum is the solid line. Only the NFW component was used in the matched 
filter, which extends to 1.0 h Mpc. Note that we can measure the cluster 
profile out to 20 Mpc from the center of all but the poorest clusters. The 
mean background of unrelated galaxies has been subtracted. 



estimated by averaging the membership probability weighted 
positions of the cluster galaxies. While this eliminated the 
central spike, the profile shape began to depend on the num- 
ber of member galaxies Ng&&. Essentially, we measure the 
true average profile convolved with the position measurement 
errors, and these increase considerably as the number of mem- 
ber galaxies available for the average decreases. This problem 



has been seen by JDai et alT] ( [2007] ) andjRykoff et al. (2008), 
who both stack X-ray images of optically-selection clusters. 
Despite these problems, our profiles match the expected shape 
reasonably well apart from the innermost radial bin. 

Second, on intermediate scales (0.1/T 1 Mpc < R < 1.6/T 1 
Mpc), which dominate our detection and parameter estima- 
tion, the average surface density of the galaxies matches rea- 
sonbly well the profile shape expected from our matched filter. 
The observed and NFW surface density profiles have formal 
chi-square differences, for 13 degrees of freedom, of 9.0 to 
46 for the first iteration and 2.8 to 42 for the second iteration. 
These large values arise largely from ~ 10% normalization 
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differences between the data and the model (which was cal- 
culated with no free parameters); these differences are unim- 
portant for the matched filter, where the normalization is free 
to vary. 

Third, on large scales (R > 1.6/T 1 Mpc), the observed sur- 
face density lies above the projected NFW profile. The NFW 
model represents only the virialized regions of the cluster and 
neglects the correlated structure outside the virial radius — 
the "two-halo" term, in the parlance of th e halo model of large 
scale structure (e.g., |Coora y & Sheth 2002). This outer region 
of clusters should provide a good testing ground for the halo 
model because it represents the dividing line between linear 
and nonlinear regimes. We create a crude model for the pro- 
file of the two-halo term consisting of a broken power law: 



3 < iV s 666 < 5 



i + RyRj h f M/2 



(18) 



where fl2h(^V*) is a richness-dependent normalization constant 
for the two-halo term and R 2 h is a break radius. The inner 
and outer slopes and (3 out we set to 2 and 0.8 respectively; 
the former because it made the enclosed mass an analytic 
function, and the latter by analogy with the slope of typi- 
cal correlation functions on these scales. We adjust the nor- 
malization and scale radius until they roughly match the ob- 
served second-iteration profile, finding that fl2h(^V*) = (0. 17 ± 
0.01)M a33±0 04) /j 2 Mpc" 2 and that R 2h = (1.66 ± O.IO)^ 1 
Mpc. We do not add the two-halo term to the matched filter 
in the second cluster search iteration, because its contribution 
inside 1.0 A -1 Mpc (the sampling radius) is very small relative 
to the NFW component. 

For second-iteration clusters, we repeat the stacking pro- 
cess for the membership subsamples of each richness bin in 
order to check for evolution of the cluster profiles with the 
number of detected galaxies. The resulting richness estimates 
are shown in Figure [5] We plot only samples where the prod- 
uct of the number ofclusters and the average value of N g (,6(, 
is greater than 30; this excludes three low-richness samples. 
On scales less than ~ 1 h~ l Mpc, we do not see any evidence 
for changes in the profile with the number of visible galaxies, 
unless the profile is very noisy. 

3.2. Velocity Distribution 

Our cluster search algorithm produces two estimates of the 
velocity dispersion of each cluster. The first estimate is the 
value of a c used in the matched filter. We maximize the likeli- 
hood for each cluster by varying the velocity dispersion using 
MCMC optimization, subject to our priors. We also compute 
the velocity dispersion using galaxies with redshift measure- 
ments within Cr c of the center of each cluster, weighting the 
contribution of each galaxy by its cluster membership prob- 
ability f m emb (see Equation (fl3]l). We average the latter ve- 
locity dispersion estimate in each richness bin, and use this 
average to define our model in the following analysis. 

As with the cluster density profiles, the typical cluster con- 
tains too few galaxies to accurately estimate the velocity dis- 
tribution in a non-parametric way. So we again stack large 
numbers of clusters in fixed richness bins to determine the 
average velocity distribution. We first identify all galaxies 
with measured redshifts within the projected virial radius, 
R < 7"666, of each cluster, so as to compare velocity histograms 
inside the virialized region for clusters of differing richness. 
We discard those galaxies which lack redshift measurements; 




10° 
R (hT 1 Mpc) 



FIG. 5. — Projected cluster profiles for cluster subsamples with different 
numbers of detectable galaxies JV ? 666- The clusters with the fewest visible 
galaxies are in the top panel, and the number increases toward the bottom 
panel. The points' colors and shapes indicate richness bin, as in Figure[4] and 
the solid curves show the sum of the NFW and two-halo term. We do not see 
strong evidence for evolution in the profile with A^666- 

this should be relatively unbiased with respect to the veloc- 
ity distribution since any target selection method for mea- 
suring redshifts is unbiased with respect to the relevant ve- 
locity differences. We then construct histograms of the rest 
frame line of sight velocities (relative to the systemic veloc- 
ity) Av = (v-cz c )/(l +z c ), considering only velocities where 
|Av| < 2000 km s" 1 . We use variable bin widths of 50, 75, 
100, 150, and 250 km s" 1 for the five richness bins. Finally, 
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Av (km s" 1 ) 



FIG. 6. — The rest-frame line of sight velocity distribution of the first- 
iteration (first panel) and second-iteration (second panel) cluster galaxies in 
the five richness bins. The points show the average velocity distributions, 
with the same colors and shapes as in Figure [4] and the solid curves are the 
Gaussian models used by the matched filter. 

we average the histograms over the clusters in each richness 
bin, excluding clusters with N v < 5 galaxies with measured 
redshifts and clusters with virial radii extending into the re- 
gion where \b\ < 6°. The requirement of five redshift mea- 
surements limits the effect of the sample variance bias. We 
also calculate the average richness and velocity dispersion of 
the clusters in each richness bin, weighted by N v —l. 

Figure [6] shows the velocity distributions measured in this 
way for the first and second cluster search iterations, for the 
same richness bins we used for our profile measurement. Su- 
perposed on the distributions are Gaussian curves with widths 
set by the average velocity dispersion in the bins. The nor- 
malization of the curves is arbitrary, and they are adjusted to 
match the observed distributions. We find excellent agree- 
ment between the predicted curve and the observed distribu- 
tion. Figure [7] shows the velocity distributions as a function 
of visible galaxy number N g ^. We impose the same cut as 
was previously used to weed out three low-richness samples. 
No significant evolution of the velocity distributions with the 
number of detected galaxies A^666 can be seen. Although the 
velocity distributions look somewhat narrow in the lowest- 
membership sample, we think this is due to the low number 
of galaxies per cluster. 

One advantage of the stacked clusters is that they contain 
many galaxies, so it is easy to estimate a Gaussian width with- 
out using a statistical method that is dominated by the tails of 
the distribution. We clip the velocity distributions at twice the 




Av (km s" 1 ) 



FIG. 7. — Velocity distributions for cluster subsamples with different num- 
bers of detectable galaxies N g (,66, which increases from the first frame to the 
last. The points' colors and shapes indicate richness bin, as in Figure|6] and 
the solid curves are identical to those in that figure's second panel. We see no 
strong evidence for evolution of the velocity distributions with N g f,66 ■ 

average velocity dispersion for each richness bin. We then 
sort the velocity differences and estimate the velocity disper- 
sion as one-half the velocity range encompassing 68.3% of 
the galaxies centered on the median. Like a simple velocity 
dispersion, this estimate is exact for a Gaussian distribution, 
but it is insensitive to interloping field galaxies. The resulting 
velocity dispersion estimates are reported, along with the dis- 
persions determined by averaging over each richness bin, in 
Table Q] The differences between the two estimates are less 
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TABLE 1 
Velocity Dispersions 



Richness" 


<cr r ) (kms-') b 


CTsort (km s 1 ) c 


0.1<JV»666 <0.3 


197 


190 


0.3 < JV, 666 < 1 


261 


250 


1 < #.666 < 3 


357 


338 


3 < JV.666 < 10 


561 


531 


10 < W, 666 < 30 


862 


821 



a Though we do not show it here, the bin edges have 
been reduced by a factor of 0.838 because W*666 is sys- 
tematically smal ler f or the second cluster search itera- 
tion (see Section [2~2) . 
b Average velocity dispersion in the bin. 
c Velocity d isper sion estimated by sorting the velocities 
(see Section|3/2l. 

than 10%. 

3.3. Luminosity Function 

The final distribution we consider is the luminosity function 
of the cluster galaxies. Since clusters at different distances 
have different galaxy luminosities corresponding to the survey 
magnitude limit, some care is required in combining clusters 
within a richness bin. We first construct a histogram of the 
absolute magnitudes of galaxies within R ydmp = 1.0 h~ l Mpc 
of each cluster center, with bins 1/3 mag in width. Each his- 
togram extends to the faintest bin whose faint edge is brighter 
than the survey limit. In this same set of magnitude bins we 
calculate the "background" of interloping galaxies by taking 
the distribution of apparent magnitudes of our galaxy sam- 
ple, shifting this distribution by the distance modulus X>(z c ), 
and scaling the resulting distribution by the ratio of the angu- 
lar area of the cluster to that of the whole survey. We sub- 
tract this background from the luminosity function and scale 
the difference by the factor N^g(R smnv /r c )/F{C) to account 
for our cylindrical sampling volume. Finally, we average the 
result over all the clusters in each richness bin, weighting the 
clusters by this same factor. The number of clusters contribut- 
ing to the luminosity function estimate is a strong function of 
absolute magnitude; the faintest bins only have contributions 
from the nearest clusters, while the brightest bins average over 
tens to hundreds of clusters, depending on the richness and 
membership bin. As before, we exclude clusters with sam- 
pling radii extending into the excluded region of low Galactic 
latitude where \b\ < 6°. 

Figure [8] shows the observed cluster galaxy luminosity 
function for clusters from our first and second search iter- 
ations. In each case, the luminosity function used for the 
matched filter is plotted as a short-dashed curve, with a nor- 
malization adjusted to best fit the observed data. The curve 
in the first panel clearly lies below the data, especially at the 
bright end. So we fit a second Schechter function to the data; 
this is plotted as a dotted curve. This function was character- 
ized by M Kt = -23.59 ± 0.05 mags and a = -1.08 ± 0.03. We 
used this function for the second matched filter; it is therefore 
the short-dashed curve in the second panel. It closely matches 
the best-fit Schechter function for that panel, which has pa- 
rameters M K * = -23.64 ±0.05 mags and a = -1.07 ±0.03. 
This fit is good, producing a chi-square of 169.6 for 130 de- 
grees of freedom. 

There has been considerable interest in the luminosity fuc- 
tion of BCGs and whether they l ie on the same Schechter 
function as satellite galaxies (e.g., Yang et al .|2008 1. In Fig- 
ure [8] we see a small excess at the bright end of the lumi- 




FlG. 8. — The cluster luminosity function for first-iteration (first panel) 
and second-iteration (second panel) clusters in the five richness bins. The 
points show the measured luminosity functions, normalized to unit N* . In 
each panel, the short-dashed curve signifies the Schechter function used in 
the matched filter, and the dotted curve is the best-fit Schechter function. 
The solid curves indicate the best-fit sum of Schechter and BCG compo- 
nents, which are also shown separately as long-dashed lines in the second 
panel. The slanted lines show the level of foreground/background contami- 
nation that is subtracted to make the estimate of the luminosity function for 
each richness class; the contrast of clusters relative to this contamination de- 
pends strongly on richness. At very high and very low luminosities, the fore- 
ground/background subtraction fails. 



nosity function relative to the best-fit (dotted) Schechter func- 
tion, suggesting that we are seeing a separate population of 
BCGs. We parameterize this population using a Gaussian lu- 
minosity distribution in magnitude space. To investigate the 
BCG population, we attempt to identify in each cluster the 
galaxy with the greatest probability of being the BCG. The 
BCG probability is the product of the galaxy's cluster mem- 
bership probability, defined in Equation ( |15} , and the prob- 
ability that the galaxy's luminosity is drawn from the Gaus- 
sian BCG distribution. For the clusters found in the first it- 
eration, we made a guess at this distribution, centering the 
Gaussian at -25.1 mags, with a variance of (0.4 mags) 2 . 
The exact parameters of this distribution are not very im- 
portant; its main function is to help us rank the galaxies by 
BCG probability. We exclude any galaxy that lacks a red- 
shift measurement, has membership probability P mem b < 0.5, 
or is farther than R samp = 1.0 /T 1 Mpc from the cluster cen- 
ter. Out of our sample of 1532 clusters with N g (,(,6 > 3, this 
process yields BCGs for 1507. The magnitude distribution of 
these galaxies is close to Gaussian in shape, and has mean 
-24.61 mags and variance (0.56 mags) 2 . We adopt these 
parameters for the BCG luminosity distribution, and repeat 
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our fit of the luminosity function with this Gaussian term 
included. We find best fit values of M K * = -23.22 ± 0.07 
and a = -0.90 ± 0.05, with the BCG distribution peaking at 
(0.045 ±Q.Q03)4> C (M K #)- This two-part luminosity function is 
shown as a solid curve in the first panel of Figure [8] The best- 
fit luminosity function for the second-iteration cluster sam- 
ple is nearly identical, with Mk* = -23.27 ±0.07 mags and 
a = -0.89 ±0.05. The BCG curve is centered on -24.57 mags, 
with a variance of (0.59 mags) 2 , and reaches a maximum of 
(0.048 ±O.OO9)0 e (M^). This function is likewise plotted as 
a solid curve in the second panel of Figure IS] and its com- 
ponents are shown as long-dashed curves. This fit is slightly 
better than the Schechter-only fit, with a chi-square of 141.6 
for 129 degrees of freedom. 

The luminosity function calculated using the membership 
subsamples is shown in Figure [9] In each panel we also plot 
a solid curve showing the same best-fit luminosity function 
as is shown in the second panel of Figure [8] We again see 
no strong evidence of evolution with the number of observed 
cluster members. 

Our luminosity function results broadly agree with previous 
estimates for the IR luminosity functions of cluster galaxies. 
For example, |Balogh et al.| ( |2001| l estimate the /f-band lu- 
minosity function using 2MASS galaxies matched to groups 
and clusters in the LCRS. For groups (a < 400 km/s, or 
N*666 < 3), they find estimates of M K * = -23.58 ±0.13 and 
a = -1.14 ±0.26, whereas for clusters (a > 400 km/s, or 
N*666 > 3) the y find M g „ = -2 3.81 ± 0.40 and a = -1.30 ± 
0.43. Similarly, |Lin et alT] ( |2.004] > examine X-ray selected clus- 
ters in 2MASS data, finding -1.1 < a < -0.84 and -24.57 < 
M K * < -23.25. In their study of clusters from the Cana- 



dian N etwork for Observational Cos mology survey (fYee et al. 
|1996| l at a median redshift of z = 0.3, |Muzzin et al.| ( |20T)7| > find 
M K * =-23.76 ±0.15 an d a = -0.84 ±0.08 . We have adjusted 
t he va lues of Mk* from |Lin et al7| ( |2004| l and |Muzzin et al.| 
(2007) to account for their use of h = 0.7. 

4. PRIORS 

As alluded to in previous sections, we make a number of ad- 
justments to the likelihood function in Equation (fTJ in order 
to stabilize parameter estimation and reflect prior knowledge. 
Each modification takes the form of an additive term, usu- 
ally the logarithm of a multiplicative prior probability distri- 
bution. The exact formulas for these additions, labeled Prior I 
through Prior VI, are listed in Table [2] with their parameters 
listed in Table |3l The first prior imposes a prior on the rich- 
ness N* with a slope of -2, and turning over at N* = 0.1, the 
expected richness of the poorest groups. This enforces a rea- 
sonable mass function for clusters, with a power-law slope. 
Prior II makes the likelihood independent of a c when the can- 
didate cluster contains only one galaxy with a redshift mea- 
surement, removing a bias in the optimization of this param- 
eter's value. Prior III reflects empirical relationships between 
N* and the parameters a c and r c , imposing a Gaussian cen- 
tered on a power-law relationship between the richness and 
the parameter, with a width designed to match the scatter in 
the parameter at fixed richness. In using these priors, we are 
following K03 (although they do not include the prior on the 
N* — r c relationship, because unlike us they fix r c = 0.2 h~ l 
Mpc for all clusters). We also include three priors not used 
by K03, labeled Priors IV through VI. The first of these is 
very similar to Prior I; it requires clusters with large veloc- 
ity dispersions and core radii to be rare, taking the power-law 
slope of the mass function and of the mass-observable relation 




FIG. 9. — Cluster luminosity function for cluster subsamples with different 
numbers of detectable galaxies ^666. which increases from the first frame to 
the last. The points' colors and shapes indicate richness bin, as in Figure[8] 
and the solid curves are identical to those in that figure's second panel. We 
see no strong evidence for evolution of the luminosity function with ^666 ■ 



as parameters. Prior V imposes a Fermi function cutoff on the 
parameters <r c and r c , ruling out very small values. Finally, 
Prior VI puts a Gaussian prior on the difference between the 
line of sight velocity of the central galaxy and that of the clus- 
ter, and specifies that the central galaxy ought to be around L* 
or brighter. 

Some of the prior parameters listed in Table [3] are simply 
constant offsets to the likelihood, and are thus unimportant 
except insofar as they may admit or exclude clusters with 
likelihoods near the cutoff value. The parameters a[f , a ! c v , 
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TABLE 2 
Priors 



Label 


Formula 


Comments 




— lntl -t-fA/ /N 1 V^l 


Cluster mass function 


Prior II 




Unbiased estimate of a c 


Prior III 


-1/2 [logfe/of )-o™ log(iV*)] 2 /(a',") 2 
-1/2 [log(r c /rf )-af log(^)l7(«f f 


Empirical N* — a c con'elation 
Empirical JV* — r c correlation 


Prior IV 


+[a CT (l-7)-l]ln( t r c /0 
+[a r (l- 7 )-l]ln(r c /rf) 


Mass function and M c \ — a c relation 
Mass function and M c \ — r c relation 


Prior V 


-lnd+expK^-a^/A^]) 
-ln(l+exp[(^-r f )/A^]) 


Fermi function enforcing a c > a v c 
Fermi function enforcing r c > rjf 


Prior VI 


-\/2(v pec /v v p ' ec f 
-2.5a v '\og(\+L t /L cen ) 


Peculiar velocity of central galaxy 
Brightness of central galaxy 



TABLE 3 
Prior parameter values 



Label 


Parameter 


Iter. 1 Value 


Iter. 2 Value 


Prior I 


Nl 


0.1 


0.1 


Prior II 


4' 


1000 km s~' 


1000 km s~ J 


Prior III 


a 1 " 
Jn 

a{" 
al" 

4" 

' c 


0.526 
0.0744 
0.114 
0.071 

254 km s-' 
0.231 Mpc 


0.445 
0.153 
0.155 
0.037 
250 km s" 1 
0.203 Mpc 


Prior IV 


7 

<*a 
Otr 

°y 

r' c V 


1.8 
2.0 
2.25 
800 km s" 1 
0.2 Mpc 


1.8 
2.0 
2.25 
800 km s" 1 
0.2 Mpc 


Prior V 


r? 

Ar* 


200 km s' 1 
10 km s _1 
0.1 Mpc 
0.01 Mpc 


200 km s" 1 
10 km s" 1 
0.1 Mpc 
0.01 Mpc 


Prior VI 


v VJ 

' pec 

a vl 


200 km s" 1 
1.0 


200 km s" 1 
1.0 
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and r' c v fall in this category. We set them to nominal values, 
adopting K03's value for a'J . For Priors I and III, we use 
the same parameter values as K03 for the first cluster search 
iteration, making a guess for the values for the relationship 
between Af* and r c . For Priors IV and V we use theoreti- 
cally motivated guesses at the parameter values. In particular, 
Prior IV encodes the expectation tha t dN/dM oc M" 7 and that 
M oc af a oc r" r . ^Tinker et al. ( |2008[ l parameterize the cluster 
mass function using a more complicated functional form, but 
we can approximate it as a power law of slope 7 = -1 .8. 

Our third prior reflects empirical relationships between 
cluster richness and other properties. We examine the correla- 
tions between these properties in subsets of the clusters found 
in our first cluster search iteration, in order to update the pa- 
rameters of this prior for the second iteration. We turn first 
to the Af* - a c correlation. For this test we used a subset of 
902 clusters with likelihoods Aln£ > 20, expected number 
of visible galaxies A^666 > 3, and measured velocity disper- 
sions (i.e., we excluded clusters with too few redshift mea- 
surements). The top panel of Figure [10] shows the distribution 
of these clusters. We find a strong correlation, with a Pear- 
son coefficient r = 0.75. The prior used in the first iteration 
is plotted with its lcr contours, and matches the observed dis- 
tribution well. We also plot a fit to the data, with lines in- 
dicating the scatter about the fit. The best-fit relationship is 
log(cr c ) = (2.36 ±0.05) + (0.445 ±0.088) log(A^), with a scat- 




FIG. 10. — Velocity dispersion a c (first panel) and core radius r c (second 
panel) versus richness A/,, for selections of clusters from the first cluster 
search iteration. The dashed lines indicate the relationship used in Prior III 
during the first search iteration, with its lcr width, and the solid lines indicate 
the relationshipand width used during the second iteration. See Section|4] as 
well as Tables[2]and[3] 

ter of 0.153 dex. The updated parameters for Prior III that 
result from this fit are listed in Table [3] Because the meaning 
of Af* changes between iter ation s, we use the transformation 
factor calculated in Section |2T2| to express the empirical rela- 
tionships in terms of the second-iteration richness. This sim- 
ply results in a slightly different offset to the relation; these 
offsets are reflected in the values of er™ and r^ 1 reported in 
Table [3] When we examine the same relationship after the 
secondcluster finding iteration, the best-fit relationship and 
the scatter are essentially unchanged. 

We examine next the correlation between Af* and r c , using 
the same subsample of clusters as before but also including 
27 clusters with unmeasured velocity dispersions. As seen in 
the bottom panel of Figure 
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there is again a strong cor- 
relation (Pearson r = 0.77). — We plot the prior used in the 
first cluster-finding iteration. The observed correlation is off- 
set from the prior relation, and has a scatter smaller than the 
width of the prior. We find that if we adjust the prior in the 
second search iteration to match this empirical relationship, a 
similar offset and further reduced scatter are apparent in the 
second-iteration clusters. This indicates that the galaxy distri- 
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bution is not constraining r c for individual clusters, but that it 
is being determined predominantly by the priors (particularly 
by the combination of Prior III and Prior IV, which favors 
smaller values of r c ). This is not particularly surprising, since 
the detailed radial density profile of a single cluster cannot 
be well constrained unless a great many member galaxies are 
detected. Therefore we abandon the strategy of determining 
the parameters of Prior III using individual r c va lues and turn 
to the stacked profiles described in Section [XT] and shown in 
the first panel of Figure [4] We fit the projected NFW profile 
from Equation [10] to the stacked profiles in each of the five 
richness bins, varying the normalization and the scale radius 
r c . We restrict our fits to radii smaller than 1 ft -1 Mpc, and ig- 
nore the innermost radial bin, which is biased high because of 
the central galaxy in each cluster. We use the resulting values 
and uncertainties in r c to perform a power-law fit for the re- 
lationship between (iV*) (the average AT* in each richness bin) 
and r c . We obtain a logarithmic slope of 0.155 ±0.068 and 
normalization (i.e., r c for N* = 3) of 0.235 ±0.012. Since this 
relationship is determined using the stacked profiles of actual 
clusters, it is not directly affected by Prior III; nevertheless, 
it does not differ wildly from the first-iteration relationship. 
In our second cluster search iteration, we update the parame- 
ters of Prior III to these best-fit values (see Table [3J. We set 
the width of the prior a 1 " to three times the uncertainty in the 
normalization, to account for the additional uncertainty in the 
slope. 

5. CONCLUSIONS 

We follow up on the search for galaxy clusters in the 
2MASS catalog described by K03, using an iterative process 
to check and adjust several of the adjustable aspects of the al- 
gorthm. The most important component of the search process 
is the matched filter itself, that is the description of the char- 
acteristics of clusters. We check the projected density pro- 
file shape and velocity distribution of clusters, and the lumi- 
nosity function of their galaxies by stacking clusters in bins 
of richness. Overall, we find that the cluster model used by 
K03 is mostly accurate, with radial profiles closely matching 
the projected NFW model at radii less than 1 h~ l Mpc and 
velocity distributions matching the expected Gaussian distri- 
butions very well. At large radii, out to ~ 20/T 1 Mpc, the 
observed density profile lies above the projected NFW pro- 
file. We attribute the excess density to correlated structure and 
construct a toy profile to fit this "two-halo" term, but because 
our matched filter only searches for galaxies within 1 h~ l Mpc 
we do not bother to add it to our matched filter. The main dis- 
crepancy between the stacked clusters and the matched filter 
that was used to find them is in the luminosity function, which 
we find to underestimate the fraction of bright galaxies. After 
updating the matched filter with the best-fit Schechter func- 
tion, we find that the second-iteration clusters match the fil- 
ter well. The best-fit function has M K * = -23.64 ±0.04 and 
a = -1.07 ±0.03. Though a single Schechter function fits 
the data reasonably well, we find a slightly better fit when 
we add a Gaussian component at the bright end, suggest- 
ing that a separate population of BCGs is present. Based on 
our best guess of the BCGs in a subset of our clusters, we 
estimate that the Gaussian is centered at a K magnitude of 
-24.57, with a variance of (0.59 mag) 2 , including this com- 
ponent causes the Schechter parameters to change consider- 
ably; their new best-fit values are Mk* = -23.27 ±0.07 and 
a = -0.89 ±0.05. The Gaussian BCG component peaks at a 



value of (0.048 ±O.OO9)0(Afe). We do not find that the av- 
erage profiles, velocity distributions, or luminosity functions 
of clusters varied with 2V g 666> the (distance-dependent) num- 
ber of cluster galaxies that we expect to be brighter than the 
survey limit. 

We also update the priors that are added to the likelihood 
function. We include the three priors used by K03, including 
one (labeled Prior III) which takes into account the empiri- 
cal relationship between richness and velocity dispersion and 
adding one for the analogous relationship between richness 
and core radius. We also add three more priors in an effort 
to improve the completeness and purity characteristics of the 
cluster sample. The first of these, which we label Prior IV, 
discourages clusters with large velocity dispersions and core 
radii, as they are associated with (rare) massive clusters. An- 
other puts lower limits on the values that these variables can 
take, and the last puts a prior on the central galaxies of clus- 
ters, encouraging them to be brighter than and to have 
small peculiar velocities. We use the clusters found in our 
first cluster search iteration to tune the empirical relationships 
on which Prior III is based for the second iteration. 

After adjusting the parameters of the matched filter and the 
priors, we repeat our search for clusters. The richness and 
redshift distributions of the resulting sample of 7624 cluster 
candidates with Aln£ > 5 (2087 with A^ g 666 > 3) are shown 
in Figure [2] This is a larger sample than the 5793 (1532 
with iVg666 > 3) found in the first iteration. When we repeat 
the stacking analysis on this second catalog, we find that the 
shape of the filter is well-matched to the average properties 
of the clusters. We further subdivide the clusters into mem- 
bership samples with different values of iV g 666 m order to test 
for evolution in the cluster parameters with the number of de- 
tected cluster galaxies, and find no strong evidence for such 
evolution. 

We mention in closing two technical points. First, we have 
introduced the general approach for studying clusters of "cat- 
alog bootstrap resampling". By simultaneously resampling 
both the cluster catalog and the galaxies, we can include many 
statistical uncertainties in a well-defined manner. This boot- 
strap approach could also be applied to the galaxy catalog dur- 
ing the process of finding clusters, where the variance in the 
resulting cluster catalogs and properties would probe many, 
though not all, of the systematic problems associated with 
identifying clusters, and would yield meaningful constraints 
on the purity of the catalog. The one operational issue is that 
repeated galaxies should be spatially shifted away from one 
another in order to avoid overly artificial density spikes, but 
not by so much that density profiles are overly smoothed. This 
suggests scales of order r c , but tests in artificial catalogs can 
be used to test and optimize this scheme. Second, while in this 
work we have stacked clusters in order to "manually" adapt 
the matched filter used to find clusters, the process could in 
principle be automated. For instance, once an initial catalog 
is found, one could adjust the parameters of the matched fil- 
ter to maximize the overall likelihood value based on the fixed 
properties of that cluster catalog. A new catalog could then be 
constructed using that updated matched filter. Note that this 
is still an iterative process; it is doubtful that an attempt to 
simultaneously find clusters and optimize the matched filter 
would be numerically stable. 
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